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ABSTRACT 

A comparison of the usage of English articles and 
prepositions by native speakers of Finnish and of Finland-Swedish 
studied the word frequencies of these grammatical categories in three 
levels of English essays written by school-leavers in 
Finnish-language and Swedish-language schools in Finland. It was 
hypothesized that the poorest essays written by Finnish-speakers 
would show lower frequencies in the use of English articles and of 
prepositions where simple one-to-one correspondences are difficult to 
establish. The frequencies of these words were expected to rise 
somewhat in intermediate essays and to rise to almost native-speaker 
level in the good essays. Because Finland-Swedish is grammatically 
closer to English, the figures for Finland-Swedish learners were 
expected to be higher ihan those of the Finns at the lower levels, 
but the figures for the best essays were not expected to differ very 
much. One hundred fifty essays on three levels of proficiency written 
by Finnish- speaking and the same number written by Swedish-speaking 
students were chosen to test the hypothesis. The data supported that 
hypothesis. It is concluded that the pattern of learning articles and 
prepositions in English where there are no easily established Finnish 
equivalents represents an avoidance by the students of linguistic 
features that are absent in the first language and perhaps considered 
redundant by the learner at early stages. The process should be 
considered a lack of transfer rather than indirect negative transfer 
of knowledge from the first language to the second. (MSE) 
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UJ WORD FREQUENCIES IN FINNISH AND FINLAND- 
SWEDISH LEARNER LANGUAGE 

Vocabulary learning starts with the learning of high-frequency words, but 
in order to understand these high-frequency words it is essential to have a 
knowledge of the functions of the basic grammatical categories to which 
these words belong. If the learner can rely on at least rough cross- 
linguistic correspondences between the basic grammatical categories, it 
will be relatively easy for him to establish simplified equivalences between 
individual lexical items. When such equivalences can be established for 
high-frequency words this will help the learner considerably in his learn- 
ing, whereas, for instance, low-frequency loanwords, formally similar to 
equivalent LI -words obviously are of much less help. A workable refer- 
ence frame in the LI is especially important for the early stages of learning 
(cf. Ringbom 1985). 

Grammatical categories not found in the learner's LI pose especially 
great learning problems. In English, the use of the articles has been men- 
tioned by several error analysts as a particularly difficult area to learn for 
learners whose LI does not have articles. Thus Oiler A Redding 
found that there was a clear difference between two groups of learners: 
"GI (students whose native languages have formal equivalents) performed 
better on the test of article usage than G2 (students whose native language 
did not have equivalents....) The differences... were statistically signifi- 
cant" (1971:90 f.). The 'earning problems for Czech learners was men- 
^ tioned by Duikova ( 1969). and for Finnish learners there are several inves- 
ligations arriving at the same result (Granfors & Palmberg ( 1976). Herra- 
U nen (1978). Ringbom (1978a). Sajavaara (1983), and Ekman-Laine 
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In particular, Finnish learners, especially ai the early and intermediate 
stages of learning, tend to omit the articles where they should have been 
included. This obviously reflects a Finn's problems of finding a reference 
frame for this category. Since function words like articles have only 
grammatical meaning, not lexical meaning, it is easy for a Finnish learner 
to perceive articles simply as redundant (cf. George 1972:13 f.), paying 
little or no attention to them in reception, and neglecting them in 
production. 

When Finnish school-leavers write their matriculation examination 
they have normally read English for ten years. Having been a national 
examiner in English for about the same length of time, I have constantly 
been struck by the enormous differences in quality between the candidates 
taking part in this compulsory examination, part of which consists of an 
essay. Although the best of these essays are very native-like, some of the 
bottom candidates produce English that is hardly comprehensible, even to 
one who knows the writer's LI. The impression one gets of these is of a 
kind of reduced code, where especially articles and prepositions are fre- 
quently omitted. 

A frequency count of the most commonly occurring words in Finnish 
and Finland-Swedish learner language to verify such subjective im- 
pressions may provide an interesting supplement to error analysis, especial- 
ly when these frequencies are placed beside comparable native speaker 
figures. The main hypothesis to be tested is, then, that the poorest Finnish 
essays will show lower frequencies in the use of English articles and of 
such prepositions where simple one-to-one correspondences are especially 
difficult to establish. The frequencies of these words will rise in intermedi- 
ate essays and rise even further, to about native speaker level, in the good 
essays, the figures for Finland-Swedish learners are expected to be higher 
than those of the Finns at the low level, but the figures for the best essays 
are not expected to differ very much either from the corresponding Fin- 
nish figures or from the native speaker norm. 

In order to test this hypothesis, word frequencies for the most com- 
monly used words were counted. On the basis of the marks awarded by the 
examiner, 300 essays, ISO written by candidates from Finnish-language 
schools on ten different topics and the same number from Swedish- 
language schools on IS different topics, were selected, representing three 
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different levels of proficiency. The 50 good essays from each language 
group had been awarded a mark of 85 or more out of 99 possible, the 50 
iniermediate ones were those with marks between 60 and 75 points, and 
the 50 poor essays had received 50 or less points, i.e. they had either failed 
(the limit for an accepted essay being 48) or were very close to failure. 

The average number of words used in these essays was 201. the total 
number of words being 29.076 for the Finns and 30.288 for the Swedes 
In fact, it proved difficult to And a sufficient number of the poor 
category among Swedish speakers. Since much fewer candidates from 
Swedish schools fail the examination.' the upper limit of the worst Swe- 
dish essays was raised to 58. 

To have only one native-speaker norm would probably have been in- 
sufficient and therefore three native-speaker norms were selected as well. 

The lowest level consisted of 50 essays written by 15-year-old pupils at 
Hookergate Comprehensive School. Rowlands Gill, Eneland. These es- 
says were all on the same topic. My School, and their average number of 
words was 390. the total number of words used 19.540. 

The second group was made up of 50 American undergraduates 
studying in their first year at Purdue University. Indiana. Their essays 
were on 27 different topics and their average number of words was 706, 
making up a total of 35.309 words. 

The third native-speaker norm is that of the Brown Corpus, Section A, 
Press Reportage, and the figures are taken from Ame Zettersten's Word- 
frequency List ( 1978). Each of the 44 samples of this professional writing 
contained about 2.000 words and the total number of words was 88.753. 

The difference in length and the varying number of topics might at first 
hand seem obstacles to really worthwhile comparisons. However, if 
should be made clear that this study is only concerned with function 
words, not content words. Its focus is not content, but language, and the 
most frequently recurring aspects of language at that. 

The fact that number five on the frequency list of the English school 
ch.ldren's essays is the v ord school is uninteresting in this context, since it 
merely reflects the fact that all essays in this group deal with school. What 
is interesting is to see possible differences in frequencies of function words 
both within and across the three groups, since this reflects how these 
different groups use language, not what they write about. 
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The native-speakers that are, in all respects, most comparable to ma* 
triculation examination candidates in Finland as far as general maturity is 
concerned are the American students, and here the differences in the size 
and type of samples is negligible if the three proficiency levels are merged 
into one Finnish and one Swedish category. There may, however, be 
certain interesting variations in the native-speaker norms (especially 
school children — university students — professional writers) that may 
possibly be mirrored in the different Finnish and Finland-Swedish profi- 
ciency levels. 

Let us first compare the mean frequencies per 100 words of the most 
commonly used words 2 for the American students with the total number 
of Finnish and Finland-Swedish (hereafter Swedish) student*. 



TABLE 1 





American 








students 


All Finns 


AU Swedes 


the 


5.4 


4.4 


4.8 


to 


3.3 


3.0 


17 


tod 


14 


17 


16 


a 


15 


1.9 


3.4 


of 


16 


10 


11 


to 


1.7 


1.9 


10 


is 


1.7 


1.6 


It 


it 


0.9 


1.5 


1.5 


be 


0.7 


at 


1.6 


that 


1.2 


1.6 


1.3 


have 


0.5 


1.3 


1.4 


are 


0.6 


0.9 


OS 


for 


0.8 


0.7 


0.9 


but 


0.3 


0.9 


0.7 



Table I shows that there seem to be some general differences between 
learner language and native language. American students make more 
frequent use of the definite article than Swedes or (especially) Finns, and 
also the frequency of the word of is higher — no doubt mainly because of 
more frequent use of the o/-genitive. On the other hand, both Finnish and 
Swedish learner language uses the words if. have and but more frequently. 
The high frequency of but may be assumed to reflect a general overuse of 
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the most common clauj* connecters, with corresponding infrequent use or 
no use at all of other clause connecters, whereas native speakers would 
probably use a wider variety of clause connecters generally. Without de- 
tailed studies no immediate explanation can be suggested for the high 
frequencies of it and have in both Finnish and Swedish learner language, 
or for the Swedish high frequency of be.* 

These differences, however, tend to disappear, or are at any rate con- 
siderabiy diminished if we compare the American students only with the 
best of the Finnish and Swedish candidates. This is true particularly of the 
frequencies for the definite and indefinite articles and for of. as can be seen 
from Table II. 



TABLE II 





American 








students 


Good Finns 


Good Swedes 


the 


5.4 


6.0 


5.3 


to 


3.3 


3.7 


3.0 


tnd 


2.4 


2.7 


2.3 


t 


2.5 


2.7 


3.1 


of 


2.6 


2.6 


2.8 


in 


1.7 


1.7 


1.7 


is 


1.7 


1.7 


1.8 


it 


0.9 


1.2 


1.3 


be 


0.7 


1.2 


2.0 


that 


1.2 


I.I 


1.2 


have 


0.5 


1.0 


1.2 


are 


0.6 


0.8 


0.7 


for 


0.8 


08 


0.8 


but 


0.3 


0.7 


0.8 



The examiner's impression of near-native language in these good Fin- 
nish and Swedish essays is thus reflected in the similar number of occur- 
rences of high-frequency words. This table shows much smaller differences 
than table III, where, above all, fluencies of the good group differ very 
much from those of the poor group, with the intermediate group usually 
occupying an intermediate position. 
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Poor 



the 

IMS 




ft nH 


? 7 

X. 1 


tn 
W 




111 


1th 


It 


I.I 


it 


1.7 


that 


1.7 


have 


1.6 


of 


1.6 


a 


1.3 


was 


1.0 


axe 


1.0 


but 


at 


as 


0.7 


not 


0.7 



TABLE III 



Finns 
In termed. 



the 


3.9 


and 




to 


2,7 


in 


iO 


that 


1.9 


of 


1.7 


a 


1.6 


is 


1.5 


it 


1.5 


have 


1.4 


was 


1.2 


but 


I.I 


are 


0.8 


for 


O.t 


at 


0.7 



Good 



the 


6.0 


to 


3.7 


and 


2.7 


a 


2.7 


of 


2.6 


in 


1.7 


is 


1.7 


it 


1.2 


be 


2.2 


that 


I.I 


have 


1.0 


are 


0.8 


for 


0.8 


but 


0.7 


has 


0.6 



Swedes 

Poor Intermed. Good 



the 


4.3 


the 


4.6 


the 


5.3 


a 


3.3 


a 


4.0 


a 


3.1 


and 


16 


and 


2.9 


to 


3.0 


to 


11 


to 


2.9 


of 


2.8 


in 


1.9 


in 


2.4 


and 


13 


is 


1.8 


of 


1.8 


be 


2.0 


have 


1.8 


it 


1.8 


is 


1.8 


of 


1.7 


is 


1.7 


in 


1.7 


be 


IS 


be 


1.3 


it 


1.3 


it 


I.S 


have 


1.3 


that 


1.2 


that 


1.5 


that 


1.2 


have 


1.2 


are 


HO 


would 


I.I 


as 


0.9 


when 


0.9 


for 


1.0 


but 


0.8 


would 


0.7 


with 


0.7 


for 


0.8 


but 


0.6 


but 


0.7 


are 


0.7 
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Native bpcaitcrs 



School child 


rcn 


Students 




Brown Corpus 


the 


4.9 


the 


54 


the 


7 A 


to 


3 3 


to 


3.3 


of 


i i 


and 


3.1 


of 


26 




L.J 


in 


1.9 


a 


2.5 






of 


1.9 


and 


2.4 


to 


1 s 


was 


1.8 


IS 


17 


in 


1 \ 


a 


1.6 


in 


1.7 


for 


t I 
1 ■ • 


it 


I.I 


that 


1.2 


that 


1.0 


when 


1.0 


tt 


0.9 


is 


0.9 


at 


0.9 


for 


08 


was 


0.8 


for 


0.7 


was 


0.7 


on 


0.8 


had 


0.7 


be 


0.7 


at 


0.7 


have 


0.7 


as 


0.7 


with 


0.7 


but 


0.7 


this 


0.7 


be 


0.6 


that 


0.6 


arc 


0.6 


as 


0.6 



Table HI verifies the hypothesis of the Finns* use of articles. In their 
article frequencies the best Finns do not differ much either from the 
Swedes or the Americans, but there is a steadily rising curve from the poor 
to the good students* with the end points being quite far from one another, 
as can be seen from Table IV and Diagrams 1-4. 



TABLE IV 



Finnish Swedish NS 





Poor 


Interm.Good 


Poor 


Interm.Good 


School Univ. 


BC 


The 


3.2 


3.9 6.0 


4.3 


4.6 S.3 


4.9 S.4 


7.4 


A 


1.3 


1.6 2.7 


3.3 


4.0 3.1 


1.6 2.S 


2.5 


An 


0.1 1 


0.14 0.34 


0.3S 


0.33 0.28 


0.09 0.42 


0.36 


Total 


4.6 


5.7 9.0 


7.9 


8.9 8.7 


6.6 8.3 


10.2 



0 
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frequency of an 



Diagram 4 
total frequency of 
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In addition to articles, English prepositions make up another main 
problem area for Finnish learners, since this is a word class that hardly 
exists in Finnish, and correspondences to Finnish cases are much less nat- 
urally established than to, for instance, the prepositions in Swedish How- 
ever, not all prepositions pose the same problems for the Finnish learner. 
For certain prepositions, notably in and with, Finnish equivalent* (the 
inessive case for in and the postposition kanssa for with) can be established 
that will work in most cases. Those English prepositions, on the other 
hand, which are so multifunctional that a Finnish correspondence is not 
easily established are, above all, of, on and by, for which a very wide 
variety of Finnish constructions will be used. With these prepositions one 
might therefore expect the same picture to emerge as for the articles: that 
the Finns at the low and intermediate proficiency level use these words 
much less frequently than either the American students or the correspond- 
ing Swedish learners, whereas the figures would be by and large similar for 
ill and with. 

TABLE v 







Finnish 




Swedish 






NS 






Poor 


Intcrm.Good 


Poor 


Inc. 


Good 


School Univ. 


BC 


In 


2.0 


10 


1.7 


1.9 


2.4 


1.7 


1.9 


1.7 


13 


With 


0.4 


0.5 


0.3 


0.S 


0.7 


0.S 


0.S 


0.6 


0.7 


Of 


1.5 


1.7 


2.6 


1.7 


t.t 


11 


1.9 


16 


3.3 


On 


0.2 


0.3 


0.4 


0.4 


0.S 


0.S 


0.6 


0.6 


0.7 


By 


0.16 


0.19 


0.31 


0.06 


0.IS 


0.26 


0.11 


0.3 


0.6 
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Diagram 3 

frequency of m 



0.8 

0.6 ■ 
0.4 ■• 

0.2 «■ 



poor interned, good 



4 " 

3 ■■ 

2 •• 

1 




poor interned, good 



0.8 
0.6 

0.4 ■ 

0.2 " 
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Dwgram 6 
frequency of mj/A 



Diagram 7 
frequency of 
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frequency of o* 
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0.8 



0.6 




Diagram 9 

frequency of by 



0.4 



0.2 



poor inttrnad • good 

On the whole, the figures in Table V conform to the expectation. It is, 
however, dangerous to overemphasize the contrastive linguistic aspects 
here, since the table also reveals other differences. In several respects, 
especially the poor and intermediate levels of both Finnish and Swedish 
learners have frequencies very similar to those of the English schoolchild- 
ren. This is true particularly of the word of where sparse use of the 
o/-genitive must obviously be a main reason. The same is true of the word 
by, where the use of the agentive by will provide at least a partial explana- 
tion. Qf-genitives and especially passives tend to be used primarily by 
writers of a certain linguistic sophistication: i.e., native speakers at univer- 
sity < M and the really good foreign learners. Further investigation of the 
context is, however, needed here, since we need to know also whether the 
words are correctly used or not, but the tendency is the same as that of 
Danish learners. According to Faerch (1981) in was much more frequent 
than of in Danish schoolchildren, but less frequent than of in the English 
of university students reading English. It is not so much the decreasing use 
of in (26.5% vs 21.4% of all prepositions), as the increasing use of of 
(14.8% vs 27.1%) that causes the difference. Similarly, the use of by 
increases from 0.2% of all prepositions used by Danish schoolchildren to 
5.4% of the prepositions used by university students. It is thus easy to 
agree with Faerch and discern the same pattern in both Finnish and 
Finland-Swedish learner language when he states (1981: IS): "Even with- 
out a supplementary qualitative analysis of the functions of these prep- 
ositions I think it is safe to interpret these results as primarily indicating 
the growth of syntactic structures (^/-genitives and passives).** 
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The difference, between the frequencies of Finnish and Finland- 
Swedish learners are particularly illuminating about the Finnish learners. 
Above all. they pinpoint the differences between successful learners and 
unsuccessful learners who have had the same amount of training. There is 
a clear progression in the use of such English forms without easily estab- 
lishable equivalents in Finnish: from a very low frequency in the poor 
Finnish candidates, who obviously tend to omit these words when they 
should have been used, to near-native frequency in the good candidates. It 
is important to note this avoidance of ihe use of linguistic features absent 
in the LI and therefore easily perceived as redundant at the earliest stages 
of learning and perhaps as merely •difficult* at subsequent stages of learn- 
ing. It is a clear indication of the important role of the mother tongue in 
such learners as are very far from native-like proficiency, in spite of having 
go. quite extensive training. The term (negative) transfer or interference 
would, however, not be a very happy term to use in this context, since it is 
difficult to see exactly what has been transferred. Instead of classifying this 
kind of avoidance as a case of indirect negative transfer or interference 
(Dagut & Laufer 1982). we should take it as an example of lack of (posi- 
tive) transfer, since it is. in fact, the absence of a reference frame that has 
caused this avoidance. This tallies well with Schachter's (1974) findings 
that Chinese learners of English avoid producing relative clauses, since 
they find them quite difficult, not having such construction in their LI. 4 



NOTES 

I No e»act figures arc available here to support this statement, but the following table for the 
last eleven years shows the mean score of candidates from F.nn.sh and Swedish schools in 
•wo other parts of the fcngl.sh examination, a l.slening comprehension test and a reading 
comprehension test: 
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TABLE VI 

LC (max. 30 p) RC (max. 30 p) 





Finnish 


Swedish 


Finnish 


Swedish 


1974 


19.7 


22.4 


24.1 


25.7 


1975 


21.6 


24.S 


22.8 


24.7 


1976 


18.5 


214 


18.6 


22.0 


1977 


IIS 


23.2 


23.4 


25.3 


I97S 


22.1 


25.2 


22.9 


25.2 


1979 


23.3 


26.4 


19.4 


21.6 


1910 


20.9 


25.0 


18.4 


19.9 


191! 


22.5 


25.4 


22.1 


25.0 


I9S2 


21.2 


24.6 


23.6 


26.4 


I9S3 


22.8 


26.1 


23.3 


25.4 


1914 


25.6 


28.3 


22.3 


24.4 



Number of candidates: Approximately II x 25.000 Fi. =275.000 

II x 1.500 Sw = 16.500 

For a discussion of the consistently greater difference between Finns and Swedes in the 
listening comprehension test than in the reading conap rthea aton test, ace Ringbom 1978b. 

2 The frequencies of personal and possessive pronouns are highly variable, depending 
primarily not on the writer's use of language, but on his approach to the topic, and the 
words he and /. at least one of which normally occurs in any top twenty list, have therefore 
not been included in my tables of high-frequency words. 

3 A more detailed analysis of learner language would, of course, have to distinguish between 
the different uses of some of these high-frequency words, especially rAar and to. The 
frequency of the latter interestingly shows a steady rise from poor to good proficiency level 
in both Finnish and Swedish learners. A possible explanation of this will, however, have to 
be postponed to another context. 

4 For help with various tasks in connection with this paper I am grateful to the following: 
Monica Andersson. Sonja Granbacka, Auti Haapaniemi. Scppo Hovita, Jari Lehtonen. 
Sari Monni and Gerd Nyholm. 
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